MAGPIE/EGRET annotation of the 2.9-Mb Drosophila melanogaster Adh region.
نویسندگان
چکیده
Our challenge in annotating the 2.91-Mb Adh region of the Drosophila melanogaster genome was to identify genetic and genomic features automatically, completely, and precisely within a 6-week period. To do so, we augmented the MAGPIE microbial genome annotation system to handle eukaryotic genomic sequence data. The new configuration required the integration of eukaryotic gene-finding tools and DNA repeat tools into the automatic data collection module. It also required us to define in MAGPIE new strategies to combine data about eukaryotic exon predictions with functional data to refine the exon predictions. At the heart of the resulting new eukaryotic genome annotation system is a reverse comparison of public protein and complementary DNA sequences against the input genome to identify missing exons and to refine exon boundaries. The software modules that add eukaryotic genome annotation capability to MAGPIE are available as EGRET (Eukaryotic Genome Rapid Evaluation Tool).
منابع مشابه
Drosophila genomic sequence annotation using the BLOCKS+ database.
A simple and general homology-based method for gene finding was applied to the 2.9-Mb Drosophila melanogaster Adh region, the target sequence of the Genome Annotation Assessment Project (GASP). Each strand of the entire sequence was used as query of the BLOCKS+ database of conserved regions of proteins. This led to functional assignments for more than one-third of the genes and two-thirds of th...
متن کاملMolecular organization of the Drosophila melanogaster Adh chromosomal region in D. repleta and D. buzzatii, two distantly related species of the Drosophila subgenus.
The molecular organization of a 1.944-Mb chromosomal region of Drosophila melanogaster around the Adh locus has been analyzed in two repleta group species: D. repleta and D. buzzatii. The extensive genetic and molecular information about this region in D. melanogaster makes it a prime choice for comparative studies of genomic organization among distantly related species. A set of 26 P1 phages f...
متن کاملUsing GeneWise in the Drosophila annotation experiment.
The GeneWise method for combining gene prediction and homology searches was applied to the 2.9-Mb region from Drosophila melanogaster. The results from the Genome Annotation Assessment Project (GASP) showed that GeneWise provided reasonably accurate gene predictions. Further investigation indicates that many of the incorrect gene predictions from GeneWise were due to transposons with valid prot...
متن کاملApplication of a Time-delay Neural Network to Promoter Annotation in the Drosophila Melanogaster Genome
Computational methods for automated genome annotation are critical to understanding and interpreting the bewildering mass of genomic sequence data presently being generated and released. A neural network model of the structural and compositional properties of a eukaryotic core promoter region has been developed and its application for analysis of the Drosophila melanogaster genome is presented....
متن کاملAn exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: the Adh region.
A contiguous sequence of nearly 3 Mb from the genome of Drosophila melanogaster has been sequenced from a series of overlapping P1 and BAC clones. This region covers 69 chromosome polytene bands on chromosome arm 2L, including the genetically well-characterized "Adh region." A computational analysis of the sequence predicts 218 protein-coding genes, 11 tRNAs, and 17 transposable element sequenc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genome research
دوره 10 4 شماره
صفحات -
تاریخ انتشار 2000